Overview

Dataset statistics

Number of variables39
Number of observations135489
Missing cells439255
Missing cells (%)8.3%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory40.3 MiB
Average record size in memory312.0 B

Variable types

CAT20
NUM18
BOOL1

Reproduction

Analysis started2020-05-23 04:46:24.703232
Analysis finished2020-05-23 04:51:27.726078
Duration5 minutes and 3.02 seconds
Versionpandas-profiling v2.8.0
Command linepandas_profiling --config_file config.yaml [YOUR_FILE.csv]
Download configurationconfig.yaml

Warnings

CloseDate has a high cardinality: 5066 distinct values High cardinality
ElementarySchoolName has a high cardinality: 184 distinct values High cardinality
HighSchoolName has a high cardinality: 79 distinct values High cardinality
MiddleSchoolName has a high cardinality: 89 distinct values High cardinality
StreetName has a high cardinality: 15170 distinct values High cardinality
StreetNumber has a high cardinality: 10358 distinct values High cardinality
ArchitecturalStyle has a high cardinality: 264 distinct values High cardinality
TaxLegalDescription has a high cardinality: 61974 distinct values High cardinality
CurrentPrice is highly correlated with ClosePrice and 1 other fieldsHigh correlation
ClosePrice is highly correlated with CurrentPrice and 1 other fieldsHigh correlation
ListPrice is highly correlated with ClosePrice and 1 other fieldsHigh correlation
City is highly correlated with PostalCodeHigh correlation
PostalCode is highly correlated with CityHigh correlation
SchoolDistrict is highly correlated with HighSchoolName and 2 other fieldsHigh correlation
HighSchoolName is highly correlated with SchoolDistrictHigh correlation
MiddleSchoolName is highly correlated with SchoolDistrictHigh correlation
SeniorHighSchoolName is highly correlated with SchoolDistrictHigh correlation
StreetDirSuffix is highly correlated with StreetDirPrefixHigh correlation
StreetDirPrefix is highly correlated with StreetDirSuffixHigh correlation
HighSchoolName has 1548 (1.1%) missing values Missing
Occupancy has 24987 (18.4%) missing values Missing
SchoolDistrict has 7915 (5.8%) missing values Missing
SeniorHighSchoolName has 85203 (62.9%) missing values Missing
StreetDirPrefix has 132091 (97.5%) missing values Missing
StreetDirSuffix has 135005 (99.6%) missing values Missing
StreetSuffix has 13679 (10.1%) missing values Missing
ArchitecturalStyle has 10981 (8.1%) missing values Missing
TaxLegalDescription has 25836 (19.1%) missing values Missing
OriginalListPrice is highly skewed (γ1 = 241.5921536) Skewed
RATIO_ClosePrice_By_ListPrice is highly skewed (γ1 = 368.085808) Skewed
RATIO_ClosePrice_By_OriginalListPrice is highly skewed (γ1 = 141.6519951) Skewed
RATIO_CurrentPrice_By_SQFT is highly skewed (γ1 = 89.49316584) Skewed
YearBuilt is highly skewed (γ1 = 97.23247123) Skewed
df_index has unique values Unique
DOM has 1820 (1.3%) zeros Zeros
ParkingSpacesGarage has 4197 (3.1%) zeros Zeros

Variables

df_index
Real number (ℝ≥0)

UNIQUE

Distinct count135489
Unique (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean107959.31885245297
Minimum0
Maximum213410
Zeros1
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile11374.4
Q154955
median108487
Q3161734
95-th percentile203277.6
Maximum213410
Range213410
Interquartile range (IQR)106779

Descriptive statistics

Standard deviation61375.38889
Coefficient of variation (CV)0.5685047807
Kurtosis-1.196706121
Mean107959.3189
Median Absolute Deviation (MAD)53383
Skewness-0.02476599036
Sum1.462730015e+10
Variance3766938361
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
81881< 0.1%
 
1361681< 0.1%
 
541921< 0.1%
 
562411< 0.1%
 
521471< 0.1%
 
398651< 0.1%
 
337221< 0.1%
 
480611< 0.1%
 
2037211< 0.1%
 
1975781< 0.1%
 
Other values (135479)135479> 99.9%
 
ValueCountFrequency (%) 
01< 0.1%
 
11< 0.1%
 
21< 0.1%
 
31< 0.1%
 
71< 0.1%
 
ValueCountFrequency (%) 
2134101< 0.1%
 
2134091< 0.1%
 
2134081< 0.1%
 
2134051< 0.1%
 
2134041< 0.1%
 

PostalCode
Categorical

HIGH CORRELATION

Distinct count23
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
75070
27557
75035
16773
75025
13269
75023
12396
75071
11850
Other values (18)
53644
ValueCountFrequency (%) 
750702755720.3%
 
750351677312.4%
 
75025132699.8%
 
75023123969.1%
 
75071118508.7%
 
75093113488.4%
 
7507484646.2%
 
7507582766.1%
 
7506966384.9%
 
7502465584.8%
 
Other values (13)123609.1%
 

Length

Max length5
Median length5
Mean length5
Min length5

BathsTotal
Real number (ℝ≥0)

Distinct count47
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5546000044284036
Minimum0.0
Maximum9.3
Zeros24
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile2
Q12
median2.1
Q33.1
95-th percentile4.1
Maximum9.3
Range9.3
Interquartile range (IQR)1.1

Descriptive statistics

Standard deviation0.8042789529
Coefficient of variation (CV)0.3148355717
Kurtosis1.973783054
Mean2.554600004
Median Absolute Deviation (MAD)0.1
Skewness1.206619227
Sum346120.2
Variance0.6468646341
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
24942236.5%
 
2.12925821.6%
 
3.11836813.6%
 
31793513.2%
 
480646.0%
 
4.152713.9%
 
117151.3%
 
1.114671.1%
 
5.19910.7%
 
3.26840.5%
 
Other values (37)23141.7%
 
ValueCountFrequency (%) 
024< 0.1%
 
0.12< 0.1%
 
0.24< 0.1%
 
117151.3%
 
1.114671.1%
 
ValueCountFrequency (%) 
9.32< 0.1%
 
9.21< 0.1%
 
8.41< 0.1%
 
8.31< 0.1%
 
8.21< 0.1%
 

BedsTotal
Real number (ℝ≥0)

Distinct count11
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean3.681221353763036
Minimum0.0
Maximum42.0
Zeros26
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile3
Q13
median4
Q34
95-th percentile5
Maximum42
Range42
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.7923628337
Coefficient of variation (CV)0.21524455
Kurtosis80.84733718
Mean3.681221354
Median Absolute Deviation (MAD)1
Skewness1.740453791
Sum498765
Variance0.6278388602
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
46222345.9%
 
34994136.9%
 
51661512.3%
 
253694.0%
 
68890.7%
 
13610.3%
 
749< 0.1%
 
026< 0.1%
 
811< 0.1%
 
93< 0.1%
 
ValueCountFrequency (%) 
026< 0.1%
 
13610.3%
 
253694.0%
 
34994136.9%
 
46222345.9%
 
ValueCountFrequency (%) 
422< 0.1%
 
93< 0.1%
 
811< 0.1%
 
749< 0.1%
 
68890.7%
 

City
Categorical

HIGH CORRELATION

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
Plano
60769
McKinney
45033
Frisco
23784
Prosper
 
3373
Fairview
 
2530
ValueCountFrequency (%) 
Plano6076944.9%
 
McKinney4503333.2%
 
Frisco2378417.6%
 
Prosper33732.5%
 
Fairview25301.9%
 

Length

Max length8
Median length6
Mean length6.278472791
Min length5

CloseDate
Categorical

HIGH CARDINALITY

Distinct count5066
Unique (%)3.7%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
4/29/2005
 
176
4/28/2005
 
160
8/28/2009
 
158
2/27/2015
 
150
4/30/2009
 
148
Other values (5061)
134697
ValueCountFrequency (%) 
4/29/20051760.1%
 
4/28/20051600.1%
 
8/28/20091580.1%
 
2/27/20151500.1%
 
4/30/20091480.1%
 
6/30/20051440.1%
 
3/31/20051380.1%
 
6/26/20091280.1%
 
4/15/20051280.1%
 
5/29/20091280.1%
 
Other values (5056)13403198.9%
 

Length

Max length10
Median length9
Mean length8.991681982
Min length8

ClosePrice
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count10003
Unique (%)7.4%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean282178.02747667686
Minimum70.0
Maximum6547500.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum70
5-th percentile118800
Q1171000
median240000
Q3331000
95-th percentile579900
Maximum6547500
Range6547430
Interquartile range (IQR)160000

Descriptive statistics

Standard deviation192518.9748
Coefficient of variation (CV)0.6822606866
Kurtosis57.26893825
Mean282178.0275
Median Absolute Deviation (MAD)75100
Skewness5.023720581
Sum3.823173659e+10
Variance3.706355565e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25000010960.8%
 
2250009760.7%
 
2750009340.7%
 
3000008830.7%
 
1650008680.6%
 
2650008570.6%
 
2100008540.6%
 
2600008520.6%
 
2150008460.6%
 
1750008450.6%
 
Other values (9993)12647793.3%
 
ValueCountFrequency (%) 
701< 0.1%
 
811< 0.1%
 
1101< 0.1%
 
1651< 0.1%
 
2201< 0.1%
 
ValueCountFrequency (%) 
65475001< 0.1%
 
57200001< 0.1%
 
51500001< 0.1%
 
49000001< 0.1%
 
42750001< 0.1%
 

CurrentPrice
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count10003
Unique (%)7.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean282184.7647171357
Minimum70.0
Maximum6547500.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum70
5-th percentile118800
Q1171000
median240000
Q3331000
95-th percentile579900
Maximum6547500
Range6547430
Interquartile range (IQR)160000

Descriptive statistics

Standard deviation192534.2359
Coefficient of variation (CV)0.6822984795
Kurtosis57.25241326
Mean282184.7647
Median Absolute Deviation (MAD)75100
Skewness5.023170593
Sum3.823293159e+10
Variance3.706943199e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25000010960.8%
 
2250009760.7%
 
2750009340.7%
 
3000008830.7%
 
1650008680.6%
 
2650008570.6%
 
2100008540.6%
 
2600008520.6%
 
2150008460.6%
 
1750008450.6%
 
Other values (9993)12647893.3%
 
ValueCountFrequency (%) 
701< 0.1%
 
811< 0.1%
 
1101< 0.1%
 
1651< 0.1%
 
2201< 0.1%
 
ValueCountFrequency (%) 
65475001< 0.1%
 
57200001< 0.1%
 
51500001< 0.1%
 
49000001< 0.1%
 
42750001< 0.1%
 

DOM
Real number (ℝ)

ZEROS

Distinct count610
Unique (%)0.5%
Missing23
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean48.92087313421818
Minimum-97.0
Maximum1649.0
Zeros1820
Zeros (%)1.3%
Memory size1.0 MiB

Quantile statistics

Minimum-97
5-th percentile2
Q19
median28
Q367
95-th percentile163
Maximum1649
Range1746
Interquartile range (IQR)58

Descriptive statistics

Standard deviation60.37500673
Coefficient of variation (CV)1.234135919
Kurtosis25.03125538
Mean48.92087313
Median Absolute Deviation (MAD)22
Skewness3.255511113
Sum6627115
Variance3645.141438
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
353033.9%
 
450783.7%
 
543533.2%
 
239032.9%
 
635102.6%
 
732452.4%
 
1027802.1%
 
827192.0%
 
1124511.8%
 
923891.8%
 
Other values (600)9973573.6%
 
ValueCountFrequency (%) 
-971< 0.1%
 
-671< 0.1%
 
-581< 0.1%
 
-412< 0.1%
 
-391< 0.1%
 
ValueCountFrequency (%) 
16492< 0.1%
 
13241< 0.1%
 
12431< 0.1%
 
10911< 0.1%
 
10661< 0.1%
 

ElementarySchoolName
Categorical

HIGH CARDINALITY

Distinct count184
Unique (%)0.1%
Missing219
Missing (%)0.2%
Memory size1.0 MiB
Christie
 
5274
Thomas
 
2856
Glenoaks
 
2734
Johnson
 
2724
Curtsinger
 
2553
Other values (179)
119129
ValueCountFrequency (%) 
Christie52743.9%
 
Thomas28562.1%
 
Glenoaks27342.0%
 
Johnson27242.0%
 
Curtsinger25531.9%
 
Gunstream25171.9%
 
Bennett24851.8%
 
Shawnee23971.8%
 
Wolford23111.7%
 
Spears23041.7%
 
Other values (174)10711579.1%
 

Length

Max length27
Median length7
Mean length7.606477279
Min length3

HighSchoolName
Categorical

HIGH CARDINALITY
HIGH CORRELATION
MISSING

Distinct count79
Unique (%)0.1%
Missing1548
Missing (%)1.1%
Memory size1.0 MiB
Mckinney
 
14013
Jasper
 
13080
Clark
 
12386
Centennial
 
11932
Vines
 
11139
Other values (74)
71391
ValueCountFrequency (%) 
Mckinney1401310.3%
 
Jasper130809.7%
 
Clark123869.1%
 
Centennial119328.8%
 
Vines111398.2%
 
Mckinney Boyd108038.0%
 
Shepton85536.3%
 
Frisco84286.2%
 
Mckinneyno83206.1%
 
Williams73055.4%
 
Other values (69)2798220.7%
 

Length

Max length23
Median length7
Mean length7.67602536
Min length3

AssociationType
Categorical

Distinct count3
Unique (%)< 0.1%
Missing2
Missing (%)< 0.1%
Memory size1.0 MiB
Mandatory
86663
None
41007
Voluntary
 
7817
ValueCountFrequency (%) 
Mandatory8666364.0%
 
None4100730.3%
 
Voluntary78175.8%
 
(Missing)2< 0.1%
 

Length

Max length9
Median length9
Mean length7.48661515
Min length3

ListPrice
Real number (ℝ≥0)

HIGH CORRELATION

Distinct count6110
Unique (%)4.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean288190.4165578017
Minimum1.0
Maximum6750000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum1
5-th percentile120000
Q1174900
median244500
Q3339000
95-th percentile598000
Maximum6750000
Range6749999
Interquartile range (IQR)164100

Descriptive statistics

Standard deviation203813.1287
Coefficient of variation (CV)0.7072168852
Kurtosis69.71709055
Mean288190.4166
Median Absolute Deviation (MAD)77000
Skewness5.561089978
Sum3.904663135e+10
Variance4.153979145e+10
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16990016571.2%
 
17990014921.1%
 
15990014911.1%
 
19990014591.1%
 
24990014371.1%
 
18990013721.0%
 
14990013191.0%
 
29990012590.9%
 
13990011840.9%
 
22500011790.9%
 
Other values (6100)12164089.8%
 
ValueCountFrequency (%) 
11< 0.1%
 
19951< 0.1%
 
99001< 0.1%
 
132002< 0.1%
 
135001< 0.1%
 
ValueCountFrequency (%) 
67500001< 0.1%
 
63990002< 0.1%
 
54000001< 0.1%
 
51500001< 0.1%
 
49000002< 0.1%
 

LotSize
Categorical

Distinct count10
Unique (%)< 0.1%
Missing14
Missing (%)< 0.1%
Memory size1.0 MiB
Less Than .5 Acre (not Zero)
119777
.5 Acre to .99 Acre
 
6378
Condo/Townhome Lot
 
3395
Zero Lot
 
2761
1 Acre to 2.99 Acres
 
2548
Other values (5)
 
616
ValueCountFrequency (%) 
Less Than .5 Acre (not Zero)11977788.4%
 
.5 Acre to .99 Acre63784.7%
 
Condo/Townhome Lot33952.5%
 
Zero Lot27612.0%
 
1 Acre to 2.99 Acres25481.9%
 
3 Acres to 4.99 Acres3150.2%
 
5 Acres to 9.99 Acres1620.1%
 
10 Acres to 49.99 Acres1280.1%
 
Over 100 Acres9< 0.1%
 
50 Acres to 100 Acres2< 0.1%
 
(Missing)14< 0.1%
 

Length

Max length28
Median length28
Mean length26.73476814
Min length3

MiddleSchoolName
Categorical

HIGH CARDINALITY
HIGH CORRELATION

Distinct count89
Unique (%)0.1%
Missing700
Missing (%)0.5%
Memory size1.0 MiB
Dowell
 
8747
Evans
 
7438
Carpenter
 
7292
Faubion
 
7181
Renner
 
6675
Other values (84)
97456
ValueCountFrequency (%) 
Dowell87476.5%
 
Evans74385.5%
 
Carpenter72925.4%
 
Faubion71815.3%
 
Renner66754.9%
 
Wester66614.9%
 
Clark66284.9%
 
Haggard63234.7%
 
Roach58734.3%
 
Robinson53083.9%
 
Other values (79)6666349.2%
 

Length

Max length23
Median length6
Mean length7.01513776
Min length3

MLSNumber
Real number (ℝ≥0)

Distinct count125047
Unique (%)92.3%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean11898237.272088509
Minimum9308485.0
Maximum14320678.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum9308485
5-th percentile9715248.2
Q110759410
median11712748
Q313278224
95-th percentile14048282.8
Maximum14320678
Range5012193
Interquartile range (IQR)2518814

Descriptive statistics

Standard deviation1378898.113
Coefficient of variation (CV)0.1158909577
Kurtosis-1.248301438
Mean11898237.27
Median Absolute Deviation (MAD)1310408
Skewness0.146516078
Sum1.61208027e+12
Variance1.901360005e+12
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
102229153< 0.1%
 
102628883< 0.1%
 
102533343< 0.1%
 
102202913< 0.1%
 
102146093< 0.1%
 
102145783< 0.1%
 
102587383< 0.1%
 
102166453< 0.1%
 
102375783< 0.1%
 
102495943< 0.1%
 
Other values (125037)135459> 99.9%
 
ValueCountFrequency (%) 
93084851< 0.1%
 
93254341< 0.1%
 
93268791< 0.1%
 
93776601< 0.1%
 
93885821< 0.1%
 
ValueCountFrequency (%) 
143206781< 0.1%
 
143202201< 0.1%
 
143137271< 0.1%
 
143111131< 0.1%
 
143110281< 0.1%
 

NumberOfDiningAreas
Real number (ℝ≥0)

Distinct count8
Unique (%)< 0.1%
Missing2
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.7552975562231063
Minimum0.0
Maximum9.0
Zeros724
Zeros (%)0.5%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q12
median2
Q32
95-th percentile2
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.4500100065
Coefficient of variation (CV)0.2563724907
Kurtosis1.263202183
Mean1.755297556
Median Absolute Deviation (MAD)0
Skewness-1.181274207
Sum237820
Variance0.2025090059
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
210229475.5%
 
13210823.7%
 
07240.5%
 
33350.2%
 
419< 0.1%
 
54< 0.1%
 
72< 0.1%
 
91< 0.1%
 
(Missing)2< 0.1%
 
ValueCountFrequency (%) 
07240.5%
 
13210823.7%
 
210229475.5%
 
33350.2%
 
419< 0.1%
 
ValueCountFrequency (%) 
91< 0.1%
 
72< 0.1%
 
54< 0.1%
 
419< 0.1%
 
33350.2%
 

NumberOfLivingAreas
Real number (ℝ≥0)

Distinct count10
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.0672015233821446
Minimum0.0
Maximum9.0
Zeros68
Zeros (%)0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q33
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)2

Descriptive statistics

Standard deviation0.9108283203
Coefficient of variation (CV)0.4406093504
Kurtosis1.018902865
Mean2.067201523
Median Absolute Deviation (MAD)1
Skewness0.7455344679
Sum280081
Variance0.8296082291
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
25607341.4%
 
13984729.4%
 
33164423.4%
 
465234.8%
 
510610.8%
 
61890.1%
 
0680.1%
 
752< 0.1%
 
818< 0.1%
 
913< 0.1%
 
(Missing)1< 0.1%
 
ValueCountFrequency (%) 
0680.1%
 
13984729.4%
 
25607341.4%
 
33164423.4%
 
465234.8%
 
ValueCountFrequency (%) 
913< 0.1%
 
818< 0.1%
 
752< 0.1%
 
61890.1%
 
510610.8%
 

NumberOfStories
Real number (ℝ≥0)

Distinct count6
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.5312534596904546
Minimum0.0
Maximum5.0
Zeros260
Zeros (%)0.2%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1
Q11
median2
Q32
95-th percentile2
Maximum5
Range5
Interquartile range (IQR)1

Descriptive statistics

Standard deviation0.5119606511
Coefficient of variation (CV)0.3343408943
Kurtosis-1.553155593
Mean1.53125346
Median Absolute Deviation (MAD)0
Skewness-0.05682599414
Sum207468
Variance0.2621037083
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
27104752.4%
 
16359546.9%
 
35720.4%
 
02600.2%
 
412< 0.1%
 
53< 0.1%
 
ValueCountFrequency (%) 
02600.2%
 
16359546.9%
 
27104752.4%
 
35720.4%
 
412< 0.1%
 
ValueCountFrequency (%) 
53< 0.1%
 
412< 0.1%
 
35720.4%
 
27104752.4%
 
16359546.9%
 

Occupancy
Categorical

MISSING

Distinct count3
Unique (%)< 0.1%
Missing24987
Missing (%)18.4%
Memory size1.0 MiB
Owner
74955
Vacant
32541
Tenant
 
3006
ValueCountFrequency (%) 
Owner7495555.3%
 
Vacant3254124.0%
 
Tenant30062.2%
 
(Missing)2498718.4%
 

Length

Max length6
Median length5
Mean length4.893519031
Min length3

OriginalListPrice
Real number (ℝ≥0)

SKEWED

Distinct count6113
Unique (%)4.5%
Missing6
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean302765.87255227595
Minimum0.0
Maximum430000000.0
Zeros18
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile124000
Q1175140.5
median248500
Q3345000
95-th percentile612900
Maximum430000000
Range430000000
Interquartile range (IQR)169859.5

Descriptive statistics

Standard deviation1490388.528
Coefficient of variation (CV)4.922577686
Kurtosis63436.34791
Mean302765.8726
Median Absolute Deviation (MAD)78600
Skewness241.5921536
Sum4.101962871e+10
Variance2.221257965e+12
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
16990015321.1%
 
17990014531.1%
 
24990013641.0%
 
15990013441.0%
 
18990013261.0%
 
19990013241.0%
 
29990011960.9%
 
14990011920.9%
 
22500011770.9%
 
27500011130.8%
 
Other values (6103)12246290.4%
 
ValueCountFrequency (%) 
018< 0.1%
 
110< 0.1%
 
3201< 0.1%
 
10003< 0.1%
 
19901< 0.1%
 
ValueCountFrequency (%) 
4300000001< 0.1%
 
2999009001< 0.1%
 
1265000001< 0.1%
 
349900001< 0.1%
 
279900002< 0.1%
 

ParkingSpacesGarage
Real number (ℝ≥0)

ZEROS

Distinct count10
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean2.1219443788379784
Minimum0.0
Maximum9.0
Zeros4197
Zeros (%)3.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile2
Q12
median2
Q32
95-th percentile3
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.6100273583
Coefficient of variation (CV)0.2874850842
Kurtosis8.5408728
Mean2.121944379
Median Absolute Deviation (MAD)0
Skewness-0.00398990481
Sum287498
Variance0.3721333779
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
210342076.3%
 
32400917.7%
 
041973.1%
 
124541.8%
 
411060.8%
 
51610.1%
 
6890.1%
 
720< 0.1%
 
918< 0.1%
 
814< 0.1%
 
(Missing)1< 0.1%
 
ValueCountFrequency (%) 
041973.1%
 
124541.8%
 
210342076.3%
 
32400917.7%
 
411060.8%
 
ValueCountFrequency (%) 
918< 0.1%
 
814< 0.1%
 
720< 0.1%
 
6890.1%
 
51610.1%
 

PoolYN
Boolean

Distinct count2
Unique (%)< 0.1%
Missing1
Missing (%)< 0.1%
Memory size1.0 MiB
False
102465
True
33023
(Missing)
 
1
ValueCountFrequency (%) 
False10246575.6%
 
True3302324.4%
 
(Missing)1< 0.1%
 

PropertySubType
Categorical

Distinct count5
Unique (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
RES-Single Family
128567
RES-Townhouse
 
4256
RES-Condo
 
1692
RES-Half Duplex
 
913
RES-Farm/Ranch
 
61
ValueCountFrequency (%) 
RES-Single Family12856794.9%
 
RES-Townhouse42563.1%
 
RES-Condo16921.2%
 
RES-Half Duplex9130.7%
 
RES-Farm/Ranch61< 0.1%
 

Length

Max length17
Median length17
Mean length16.75961886
Min length9

RATIO_ClosePrice_By_ListPrice
Real number (ℝ≥0)

SKEWED

Distinct count15560
Unique (%)11.5%
Missing1
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean1.7592589004930324
Minimum0.0005200000000000001
Maximum105000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum0.00052
5-th percentile0.9256435
Q10.96677
median0.98511
Q31
95-th percentile1.03563
Maximum105000
Range104999.9995
Interquartile range (IQR)0.03323

Descriptive statistics

Standard deviation285.25631
Coefficient of variation (CV)162.1457251
Kurtosis135487.4408
Mean1.7592589
Median Absolute Deviation (MAD)0.01498
Skewness368.085808
Sum238358.4699
Variance81371.16239
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
11907114.1%
 
0.980393120.2%
 
0.963080.2%
 
0.971432920.2%
 
0.975612760.2%
 
0.952382750.2%
 
0.977782730.2%
 
0.966672650.2%
 
0.967742630.2%
 
0.982470.2%
 
Other values (15550)11390684.1%
 
ValueCountFrequency (%) 
0.000521< 0.1%
 
0.000541< 0.1%
 
0.000951< 0.1%
 
0.000961< 0.1%
 
0.000971< 0.1%
 
ValueCountFrequency (%) 
1050001< 0.1%
 
150.375941< 0.1%
 
101< 0.1%
 
9.478671< 0.1%
 
7.547171< 0.1%
 

RATIO_ClosePrice_By_OriginalListPrice
Real number (ℝ≥0)

SKEWED

Distinct count20711
Unique (%)15.3%
Missing25
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean15.76715562452017
Minimum0.0005099999999999999
Maximum378000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum0.00051
5-th percentile0.866329
Q10.94279
median0.97398
Q31
95-th percentile1.035177
Maximum378000
Range377999.9995
Interquartile range (IQR)0.05721

Descriptive statistics

Standard deviation1839.114122
Coefficient of variation (CV)116.6420987
Kurtosis22425.93246
Mean15.76715562
Median Absolute Deviation (MAD)0.02602
Skewness141.6519951
Sum2135881.97
Variance3382340.754
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
1134749.9%
 
0.952382980.2%
 
0.962880.2%
 
0.971432480.2%
 
0.966672360.2%
 
0.933332330.2%
 
0.952300.2%
 
0.967742260.2%
 
0.909092150.2%
 
0.980392110.2%
 
Other values (20701)11980588.4%
 
ValueCountFrequency (%) 
0.000511< 0.1%
 
0.000521< 0.1%
 
0.000921< 0.1%
 
0.000931< 0.1%
 
0.000941< 0.1%
 
ValueCountFrequency (%) 
3780001< 0.1%
 
2760001< 0.1%
 
2250001< 0.1%
 
2225001< 0.1%
 
1875001< 0.1%
 

RATIO_CurrentPrice_By_SQFT
Real number (ℝ≥0)

SKEWED

Distinct count15776
Unique (%)11.6%
Missing36
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean104.63622850730512
Minimum0.03
Maximum10000.0
Zeros0
Zeros (%)0.0%
Memory size1.0 MiB

Quantile statistics

Minimum0.03
5-th percentile64.26
Q180.57
median96.74
Q3124.05
95-th percentile164.05
Maximum10000
Range9999.97
Interquartile range (IQR)43.48

Descriptive statistics

Standard deviation43.2295142
Coefficient of variation (CV)0.413140982
Kurtosis20275.61731
Mean104.6362285
Median Absolute Deviation (MAD)19.71
Skewness89.49316584
Sum14173291.06
Variance1868.790898
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
83.33860.1%
 
100850.1%
 
12563< 0.1%
 
90.9160< 0.1%
 
111.1148< 0.1%
 
83.2947< 0.1%
 
85.7145< 0.1%
 
76.9244< 0.1%
 
96.1544< 0.1%
 
92.5943< 0.1%
 
Other values (15766)13488899.6%
 
ValueCountFrequency (%) 
0.031< 0.1%
 
0.041< 0.1%
 
0.071< 0.1%
 
0.082< 0.1%
 
0.11< 0.1%
 
ValueCountFrequency (%) 
100001< 0.1%
 
1217.731< 0.1%
 
10001< 0.1%
 
865.381< 0.1%
 
779.91< 0.1%
 

SchoolDistrict
Categorical

HIGH CORRELATION
MISSING

Distinct count13
Unique (%)< 0.1%
Missing7915
Missing (%)5.8%
Memory size1.0 MiB
Plano ISD
50439
Frisco ISD
35479
McKinney ISD
31829
Prosper ISD
 
6701
Lovejoy ISD
 
1835
Other values (8)
 
1291
ValueCountFrequency (%) 
Plano ISD5043937.2%
 
Frisco ISD3547926.2%
 
McKinney ISD3182923.5%
 
Prosper ISD67014.9%
 
Lovejoy ISD18351.4%
 
Allen ISD7620.6%
 
Lewisville ISD3020.2%
 
Melissa ISD1400.1%
 
Princeton ISD47< 0.1%
 
Celina ISD33< 0.1%
 
Other values (3)7< 0.1%
 
(Missing)79155.8%
 

Length

Max length14
Median length10
Mean length9.757146337
Min length3

SellerType
Categorical

Distinct count2
Unique (%)< 0.1%
Missing978
Missing (%)0.7%
Memory size1.0 MiB
Individual(s)
125131
Lender/REO
 
9380
ValueCountFrequency (%) 
Individual(s)12513192.4%
 
Lender/REO93806.9%
 
(Missing)9780.7%
 

Length

Max length13
Median length13
Mean length12.72012488
Min length3

SeniorHighSchoolName
Categorical

HIGH CORRELATION
MISSING

Distinct count20
Unique (%)< 0.1%
Missing85203
Missing (%)62.9%
Memory size1.0 MiB
Plano Senior
19620
Planoeast
9819
Planowest
9285
Plano West
5424
Plano East
4246
Other values (15)
 
1892
ValueCountFrequency (%) 
Plano Senior1962014.5%
 
Planoeast98197.2%
 
Planowest92856.9%
 
Plano West54244.0%
 
Plano East42463.1%
 
Plano Sr17101.3%
 
Centennial720.1%
 
Frisco65< 0.1%
 
Plano12< 0.1%
 
Jasper7< 0.1%
 
Other values (10)26< 0.1%
 
(Missing)8520362.9%
 

Length

Max length12
Median length3
Mean length5.718198525
Min length3

SqFtTotal
Real number (ℝ≥0)

Distinct count5665
Unique (%)4.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2623.4177239480696
Minimum0.0
Maximum17306.0
Zeros36
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1388
Q11892
median2413
Q33186
95-th percentile4406
Maximum17306
Range17306
Interquartile range (IQR)1294

Descriptive statistics

Standard deviation1019.39664
Coefficient of variation (CV)0.3885757996
Kurtosis5.883003564
Mean2623.417724
Median Absolute Deviation (MAD)611
Skewness1.490959604
Sum355444244
Variance1039169.509
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
19302480.2%
 
20982170.2%
 
17852150.2%
 
16592070.2%
 
18682040.2%
 
15352020.1%
 
22971980.1%
 
18481960.1%
 
18601950.1%
 
20041930.1%
 
Other values (5655)13341498.5%
 
ValueCountFrequency (%) 
036< 0.1%
 
491< 0.1%
 
2191< 0.1%
 
5201< 0.1%
 
5251< 0.1%
 
ValueCountFrequency (%) 
173061< 0.1%
 
150421< 0.1%
 
150002< 0.1%
 
149621< 0.1%
 
149191< 0.1%
 

StreetDirPrefix
Categorical

HIGH CORRELATION
MISSING

Distinct count7
Unique (%)0.2%
Missing132091
Missing (%)97.5%
Memory size1.0 MiB
N
1038
W
971
S
830
E
545
NW
 
6
Other values (2)
 
8
ValueCountFrequency (%) 
N10380.8%
 
W9710.7%
 
S8300.6%
 
E5450.4%
 
NW6< 0.1%
 
NE5< 0.1%
 
SW3< 0.1%
 
(Missing)13209197.5%
 

Length

Max length3
Median length3
Mean length2.949944276
Min length1

StreetDirSuffix
Categorical

HIGH CORRELATION
MISSING

Distinct count8
Unique (%)1.7%
Missing135005
Missing (%)99.6%
Memory size1.0 MiB
N
159
S
143
W
103
E
73
NW
 
3
Other values (3)
 
3
ValueCountFrequency (%) 
N1590.1%
 
S1430.1%
 
W1030.1%
 
E730.1%
 
NW3< 0.1%
 
SE1< 0.1%
 
NE1< 0.1%
 
SW1< 0.1%
 
(Missing)13500599.6%
 

Length

Max length3
Median length3
Mean length2.992899793
Min length1

StreetName
Categorical

HIGH CARDINALITY

Distinct count15170
Unique (%)11.2%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
Park
 
288
Virginia Hills
 
274
Preston
 
250
Cross Bend
 
221
Hickory
 
191
Other values (15165)
134265
ValueCountFrequency (%) 
Park2880.2%
 
Virginia Hills2740.2%
 
Preston2500.2%
 
Cross Bend2210.2%
 
Hickory1910.1%
 
Spring Creek1730.1%
 
14th1490.1%
 
Teakwood1460.1%
 
Mason1310.1%
 
Scenic Ranch1290.1%
 
Other values (15160)13353798.6%
 

Length

Max length20
Median length9
Mean length8.909690086
Min length1

StreetNumber
Categorical

HIGH CARDINALITY

Distinct count10358
Unique (%)7.6%
Missing0
Missing (%)0.0%
Memory size1.0 MiB
2601
 
333
575
 
297
3801
 
284
3101
 
283
2204
 
267
Other values (10353)
134025
ValueCountFrequency (%) 
26013330.2%
 
5752970.2%
 
38012840.2%
 
31012830.2%
 
22042670.2%
 
24002480.2%
 
84002460.2%
 
25242310.2%
 
22002280.2%
 
26052130.2%
 
Other values (10348)13285998.1%
 

Length

Max length9
Median length4
Mean length3.991925544
Min length1

StreetSuffix
Categorical

MISSING

Distinct count44
Unique (%)< 0.1%
Missing13679
Missing (%)10.1%
Memory size1.0 MiB
Drive
65067
Lane
20289
Court
 
8613
Trail
 
5696
Street
 
4764
Other values (39)
17381
ValueCountFrequency (%) 
Drive6506748.0%
 
Lane2028915.0%
 
Court86136.4%
 
Trail56964.2%
 
Street47643.5%
 
Road47013.5%
 
Circle33212.5%
 
Way33022.4%
 
Place19451.4%
 
Avenue12640.9%
 
Other values (34)28482.1%
 
(Missing)1367910.1%
 

Length

Max length9
Median length5
Mean length4.671154116
Min length3

ArchitecturalStyle
Categorical

HIGH CARDINALITY
MISSING

Distinct count264
Unique (%)0.2%
Missing10981
Missing (%)8.1%
Memory size1.0 MiB
Traditional
113381
Ranch
 
2338
Contemporary/Modern
 
1804
Ranch, Traditional
 
1243
Mediterranean
 
753
Other values (259)
 
4989
ValueCountFrequency (%) 
Traditional11338183.7%
 
Ranch23381.7%
 
Contemporary/Modern18041.3%
 
Ranch, Traditional12430.9%
 
Mediterranean7530.6%
 
Other5350.4%
 
Contemporary/Modern, Traditional4480.3%
 
French2890.2%
 
Craftsman2870.2%
 
French, Traditional2670.2%
 
Other values (254)31632.3%
 
(Missing)109818.1%
 

Length

Max length98
Median length11
Mean length10.65966979
Min length3

TaxLegalDescription
Categorical

HIGH CARDINALITY
MISSING

Distinct count61974
Unique (%)56.5%
Missing25836
Missing (%)19.1%
Memory size1.0 MiB
SPRING CREEK PARKWAY ESTATES W
 
217
Spring Creek Parkway Estates W
 
137
WELLINGTON AT PRESTON MEADOWS
 
131
LAKES OF PRESTON VINEYARDS VIL
 
113
VILLAGES OF WHITE ROCK CREEK #
 
106
Other values (61969)
108949
ValueCountFrequency (%) 
SPRING CREEK PARKWAY ESTATES W2170.2%
 
Spring Creek Parkway Estates W1370.1%
 
WELLINGTON AT PRESTON MEADOWS1310.1%
 
LAKES OF PRESTON VINEYARDS VIL1130.1%
 
VILLAGES OF WHITE ROCK CREEK #1060.1%
 
Wellington At Preston Meadows910.1%
 
VALOR POINTE - THE RESERVE AT WESTRIDGE PHASE880.1%
 
Winsor Meadows At Westridge #0870.1%
 
BILTMORE SWIM & RACQUET CLUB #770.1%
 
PLANTATION RESORT AUGUSTA FARM750.1%
 
Other values (61964)10853180.1%
 
(Missing)2583619.1%
 

Length

Max length50
Median length30
Mean length27.53075895
Min length1

YearBuilt
Real number (ℝ≥0)

SKEWED

Distinct count142
Unique (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1994.8067739816518
Minimum0.0
Maximum9999.0
Zeros9
Zeros (%)< 0.1%
Memory size1.0 MiB

Quantile statistics

Minimum0
5-th percentile1972
Q11988
median1997
Q32003
95-th percentile2010
Maximum9999
Range9999
Interquartile range (IQR)15

Descriptive statistics

Standard deviation75.0504116
Coefficient of variation (CV)0.03762289791
Kurtosis10534.51134
Mean1994.806774
Median Absolute Deviation (MAD)7
Skewness97.23247123
Sum270274375
Variance5632.564281
Histogram with fixed size bins (bins=10)
ValueCountFrequency (%) 
200564104.7%
 
199862924.6%
 
200161954.6%
 
200061484.5%
 
199960724.5%
 
200659824.4%
 
200459514.4%
 
199756544.2%
 
200353343.9%
 
199652713.9%
 
Other values (132)7618056.2%
 
ValueCountFrequency (%) 
09< 0.1%
 
18573< 0.1%
 
18602< 0.1%
 
18762< 0.1%
 
18771< 0.1%
 
ValueCountFrequency (%) 
999911< 0.1%
 
20201< 0.1%
 
201911< 0.1%
 
201855< 0.1%
 
20171900.1%
 

Interactions

Correlations

Pearson's r

The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.

To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.

Spearman's ρ

The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.

To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.

Kendall's τ

Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.

To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.

Phik (φk)

Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.

Cramér's V (φc)

Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.

Missing values

Sample

First rows

df_indexPostalCodeBathsTotalBedsTotalCityCloseDateClosePriceCurrentPriceDOMElementarySchoolNameHighSchoolNameAssociationTypeListPriceLotSizeMiddleSchoolNameMLSNumberNumberOfDiningAreasNumberOfLivingAreasNumberOfStoriesOccupancyOriginalListPriceParkingSpacesGaragePoolYNPropertySubTypeRATIO_ClosePrice_By_ListPriceRATIO_ClosePrice_By_OriginalListPriceRATIO_CurrentPrice_By_SQFTSchoolDistrictSellerTypeSeniorHighSchoolNameSqFtTotalStreetDirPrefixStreetDirSuffixStreetNameStreetNumberStreetSuffixArchitecturalStyleTaxLegalDescriptionYearBuilt
00750711.02.0McKinney10/3/2014150000.0150000.01649.0John A BakerProsperNone230000.01 Acre to 2.99 AcresRogers11363157.01.01.01.0Owner274900.02.0FalseRES-Farm/Ranch0.652170.54565151.82Prosper ISDIndividual(s)NaN988.0NNaNCuster5841RoadEarly AmericanAbstract A0412 Horn, George, T1930.0
11750713.04.0McKinney10/3/2014400000.0400000.01649.0John A BakerProsperNone400000.03 Acres to 4.99 AcresRogers11363138.02.02.01.0Tenant424900.02.0FalseRES-Single Family1.000000.94140152.09Prosper ISDIndividual(s)NaN2630.0NaNNaNCuster5799RoadTraditionalNaN1965.0
22750345.05.0Frisco7/29/2013555000.0555000.01324.0SpearsFriscoMandatory530200.0Less Than .5 Acre (not Zero)Hunt11281961.02.03.02.0Owner800000.03.0TrueRES-Single Family1.046770.69375119.92Frisco ISDIndividual(s)NaN4628.0NaNNaNLago Vista5507LaneTraditionalStarwood #04 Village #15, Bloc2004.0
33750783.13.0Prosper10/17/20121490000.01490000.01243.0ProsperProsperNone1900000.010 Acres to 49.99 AcresProsper11207426.02.03.02.0NaN2300000.03.0FalseRES-Farm/Ranch0.784210.64783322.93Prosper ISDIndividual(s)NaN4614.0ENaNFrontier2380ParkwayVictorianW.T. Horn Survey, A-3761990.0
47750345.15.0Frisco12/19/20131138000.01138000.01091.0SmithFriscoMandatory1190000.0Less Than .5 Acre (not Zero)Staley11503844.02.03.02.0NaN1350000.03.0TrueRES-Single Family0.956300.84296217.05Frisco ISDIndividual(s)NaN5243.0NaNNaNBriarwood3074LaneTraditionalNaN2006.0
59750935.26.0Plano6/28/2013906500.0906500.01066.0HuffmanSheptonMandatory998500.0Less Than .5 Acre (not Zero)Renner11424721.02.03.02.0Owner1150000.04.0TrueRES-Single Family0.907860.78826148.97Plano ISDIndividual(s)Planowest6085.0NaNNaNCliffview1816DriveContemporary/ModernCliffs Of Gleneagles, Blk A, L1994.0
610750752.13.0Plano3/17/2014116994.0116994.01047.0SaiglingVinesMandatory117000.0Less Than .5 Acre (not Zero)Haggard11566790.01.02.02.0Owner137500.02.0FalseRES-Townhouse0.999950.8508766.40Plano ISDIndividual(s)Plano Senior1762.0NaNNaNDevonshire3225DriveContemporary/Modern, French, TraditionalCobblestone Townhome Community1987.0
716750703.23.0McKinney12/21/2015870000.0870000.0950.0ComstockLibertyMandatory899000.0Condo/Townhome LotScoggins11933761.02.02.02.0Vacant1045000.02.0FalseRES-Condo0.967740.83254203.99Frisco ISDIndividual(s)NaN4265.0NaNNaNSettlement5724WayTraditionalRESIDENCES AT THE GRAND LODGE2012.0
820750782.13.0Prosper9/17/2014535000.0535000.0883.0Judy RuckerProsperNone594900.010 Acres to 49.99 AcresReynolds11756658.02.01.02.0Owner722775.02.0FalseRES-Single Family0.899310.74020248.84Prosper ISDIndividual(s)NaN2150.0WWProsper2076TrailTraditionalABS A0147 COLLIN COUNTY SCHOOL1987.0
923750693.03.0McKinney12/18/2013250000.0250000.0856.0MalvernMckinneyMandatory269000.0Less Than .5 Acre (not Zero)Dr Jack Cockrill11625707.02.02.02.0Vacant250000.02.0FalseRES-Single Family0.929371.0000099.88McKinney ISDIndividual(s)NaN2503.0NaNNaNPreservation700LaneTraditionalChapel Hill #01b, Blk C, Lot 12008.0

Last rows

df_indexPostalCodeBathsTotalBedsTotalCityCloseDateClosePriceCurrentPriceDOMElementarySchoolNameHighSchoolNameAssociationTypeListPriceLotSizeMiddleSchoolNameMLSNumberNumberOfDiningAreasNumberOfLivingAreasNumberOfStoriesOccupancyOriginalListPriceParkingSpacesGaragePoolYNPropertySubTypeRATIO_ClosePrice_By_ListPriceRATIO_ClosePrice_By_OriginalListPriceRATIO_CurrentPrice_By_SQFTSchoolDistrictSellerTypeSeniorHighSchoolNameSqFtTotalStreetDirPrefixStreetDirSuffixStreetNameStreetNumberStreetSuffixArchitecturalStyleTaxLegalDescriptionYearBuilt
135479213398750703.14.0McKinney12/5/2014264900.0264900.0NaNBennettMckinney BoydMandatory264900.0Less Than .5 Acre (not Zero)Dowell13050048.02.03.02.0Vacant264900.02.0FalseRES-Single Family1.000001.0000086.57McKinney ISDLender/REONaN3060.0NaNNaNTrinity2401LaneNaNFOUNTAINVIEW #3 (CMC), BLK E, LOT 392006.0
135480213399750703.14.0McKinney12/5/2014264900.0264900.0NaNBennettMckinney BoydMandatory264900.0Less Than .5 Acre (not Zero)Dowell13050048.02.03.02.0Vacant264900.02.0FalseRES-Single Family1.000001.0000086.57McKinney ISDLender/REONaN3060.0NaNNaNTrinity2401LaneNaNFOUNTAINVIEW #3 (CMC), BLK E, LOT 392006.0
135481213400750712.04.0McKinney12/18/2014199900.0199900.0NaNWilmethMckinneynoMandatory199900.0Less Than .5 Acre (not Zero)Dr Jack Cockrill13056859.01.01.01.0Owner199900.02.0FalseRES-Single Family1.000001.00000107.24McKinney ISDIndividual(s)NaN1864.0NaNNaNJuno Springs7720WayNaNVIRGINIA PARKLANDS (CMC), BLK B, LOT 142005.0
135482213401750712.04.0McKinney12/18/2014199900.0199900.0NaNWilmethMckinneynoMandatory199900.0Less Than .5 Acre (not Zero)Dr Jack Cockrill13056859.01.01.01.0Owner199900.02.0FalseRES-Single Family1.000001.00000107.24McKinney ISDIndividual(s)NaN1864.0NaNNaNJuno Springs7720WayNaNVIRGINIA PARKLANDS (CMC), BLK B, LOT 142005.0
135483213403750752.04.0Plano8/21/2014235000.0235000.0NaNDavisVinesNone235000.0Less Than .5 Acre (not Zero)Haggard13008187.02.01.01.0NaN235000.02.0TrueRES-Single Family1.000001.00000102.71Plano ISDIndividual(s)Plano Senior2288.0NaNNaNCedar Elm2625LaneNaNTIMBERCREEK ESTATES (CPL), BLK F, LOT 291972.0
135484213404750783.05.0Prosper2/16/2015327000.0327000.0NaNCynthia A CockrellProsperMandatory330000.0Less Than .5 Acre (not Zero)Rogers13086217.02.03.02.0NaN330000.02.0FalseRES-Single Family0.990910.99091100.49Prosper ISDIndividual(s)NaN3254.0NaNNaNCrescent Valley1440DriveNaNCEDAR RIDGE ESTATES (CPR), BLK B, LOT 92011.0
135485213405750783.05.0Prosper2/16/2015327000.0327000.0NaNCynthia A CockrellProsperMandatory330000.0Less Than .5 Acre (not Zero)Rogers13086217.02.03.02.0NaN330000.02.0FalseRES-Single Family0.990910.99091100.49Prosper ISDIndividual(s)NaN3254.0NaNNaNCrescent Valley1440DriveNaNCEDAR RIDGE ESTATES (CPR), BLK B, LOT 92011.0
135486213408750932.04.0Plano3/19/2015145000.0145000.0NaNNaNNaNNone155000.0Less Than .5 Acre (not Zero)NaN13107447.01.01.01.0Owner155000.02.0FalseRES-Single Family0.935480.9354879.32Plano ISDIndividual(s)NaN1828.0NaNNaNBirdsong4400LaneNaNPRESTON COVE (CPL), BLK D, LOT 11980.0
135487213409750932.04.0Plano3/19/2015145000.0145000.0NaNNaNNaNNone155000.0Less Than .5 Acre (not Zero)NaN13107447.01.01.01.0Owner155000.02.0FalseRES-Single Family0.935480.9354879.32Plano ISDIndividual(s)NaN1828.0NaNNaNBirdsong4400LaneNaNPRESTON COVE (CPL), BLK D, LOT 11980.0
135488213410750932.03.0Plano10/30/2014242500.0242500.0NaNBarksdaleSheptonMandatory259900.0Less Than .5 Acre (not Zero)Renner13024976.01.01.01.0Vacant259900.02.0FalseRES-Single Family0.933050.93305131.87Plano ISDIndividual(s)Plano West1839.0WNaNPlano5161ParkwayTraditionalOLD SHEPARD PLACE #2 3 & 4 (CPL), BLK C, LOT 91985.0